Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 124663 |
| Missing cells | 12363 |
| Missing cells (%) | 0.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 13.7 MiB |
| Average record size in memory | 115.4 B |
Variable types
| Numeric | 11 |
|---|---|
| DateTime | 1 |
| Categorical | 17 |
BARRIO has a high cardinality: 408 distinct values | High cardinality |
UNIDAD_ESPACIAL has a high cardinality: 278 distinct values | High cardinality |
TIPO_CONDUCTA has a high cardinality: 111 distinct values | High cardinality |
CRIMEN_ID is highly correlated with AÑO | High correlation |
AÑO is highly correlated with CRIMEN_ID | High correlation |
EDAD_VICTIMA is highly correlated with GRUPO_ETARIO_VICTIMA_num | High correlation |
GRUPO_ETARIO_VICTIMA_num is highly correlated with EDAD_VICTIMA | High correlation |
CRIMEN_ID is highly correlated with AÑO | High correlation |
AÑO is highly correlated with CRIMEN_ID | High correlation |
EDAD_VICTIMA is highly correlated with GRUPO_ETARIO_VICTIMA_num | High correlation |
GRUPO_ETARIO_VICTIMA_num is highly correlated with EDAD_VICTIMA | High correlation |
CRIMEN_ID is highly correlated with AÑO | High correlation |
AÑO is highly correlated with CRIMEN_ID | High correlation |
DIA_SEMANA is highly correlated with DIA_SEMANA_num | High correlation |
AÑO is highly correlated with CRIMEN_ID | High correlation |
TIPO_DELITO is highly correlated with EDAD_VICTIMA and 10 other fields | High correlation |
ZONA is highly correlated with COMUNA | High correlation |
EDAD_VICTIMA is highly correlated with TIPO_DELITO and 4 other fields | High correlation |
TIPO_ARMA is highly correlated with TIPO_DELITO and 4 other fields | High correlation |
TIPO_LESION is highly correlated with TIPO_DELITO and 3 other fields | High correlation |
DISTANCIA_ESTACION_POLICIA_CERCANA is highly correlated with LONGITUD and 1 other fields | High correlation |
GENERO_VICTIMA is highly correlated with TIPO_DELITO and 5 other fields | High correlation |
MEDIO_TRANSPORTE_VICTIMA is highly correlated with TIPO_DELITO and 3 other fields | High correlation |
LONGITUD is highly correlated with DISTANCIA_ESTACION_POLICIA_CERCANA and 1 other fields | High correlation |
LATITUD is highly correlated with DISTANCIA_ESTACION_POLICIA_CERCANA and 1 other fields | High correlation |
TIPO_DELITO_ARTICULO is highly correlated with TIPO_DELITO and 8 other fields | High correlation |
MEDIO_TRANSPORTE_VICTIMARIO is highly correlated with TIPO_DELITO and 3 other fields | High correlation |
CRIMEN_ID is highly correlated with AÑO | High correlation |
COMUNA is highly correlated with ZONA and 2 other fields | High correlation |
DIA_SEMANA_num is highly correlated with DIA_SEMANA | High correlation |
MES_num is highly correlated with MES | High correlation |
GRUPO_ETARIO_VICTIMA_num is highly correlated with TIPO_DELITO and 5 other fields | High correlation |
ESTADO_CIVIL_VICTIMA is highly correlated with TIPO_DELITO and 4 other fields | High correlation |
GRUPO_ETARIO_VICTIMA is highly correlated with TIPO_DELITO and 5 other fields | High correlation |
COMUNA_num is highly correlated with COMUNA and 1 other fields | High correlation |
ESTACION_POLICIA_CERCANA is highly correlated with TIPO_DELITO and 4 other fields | High correlation |
MES is highly correlated with MES_num | High correlation |
TIPO_DELITO_ARTICULO is highly correlated with TIPO_DELITO and 1 other fields | High correlation |
ESTACION_POLICIA_CERCANA is highly correlated with COMUNA and 1 other fields | High correlation |
TIPO_DELITO is highly correlated with TIPO_DELITO_ARTICULO and 2 other fields | High correlation |
ZONA is highly correlated with COMUNA | High correlation |
COMUNA is highly correlated with ESTACION_POLICIA_CERCANA and 1 other fields | High correlation |
TIPO_LESION is highly correlated with TIPO_DELITO_ARTICULO and 2 other fields | High correlation |
GENERO_VICTIMA is highly correlated with TIPO_DELITO and 2 other fields | High correlation |
ESTADO_CIVIL_VICTIMA is highly correlated with GENERO_VICTIMA | High correlation |
GRUPO_ETARIO_VICTIMA is highly correlated with GENERO_VICTIMA | High correlation |
LATITUD has 4121 (3.3%) missing values | Missing |
LONGITUD has 4121 (3.3%) missing values | Missing |
DISTANCIA_ESTACION_POLICIA_CERCANA has 4121 (3.3%) missing values | Missing |
DISTANCIA_ESTACION_POLICIA_CERCANA is highly skewed (γ1 = 214.9790344) | Skewed |
CRIMEN_ID is uniformly distributed | Uniform |
CRIMEN_ID has unique values | Unique |
GRUPO_ETARIO_VICTIMA_num has 8152 (6.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-03 15:44:03.529631 |
|---|---|
| Analysis finished | 2021-09-03 16:08:22.350595 |
| Duration | 24 minutes and 18.82 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
CRIMEN_ID
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 124663 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62332 |
| Minimum | 1 |
|---|---|
| Maximum | 124663 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6234.1 |
| Q1 | 31166.5 |
| median | 62332 |
| Q3 | 93497.5 |
| 95-th percentile | 118429.9 |
| Maximum | 124663 |
| Range | 124662 |
| Interquartile range (IQR) | 62331 |
Descriptive statistics
| Standard deviation | 35987.25264 |
|---|---|
| Coefficient of variation (CV) | 0.5773479536 |
| Kurtosis | -1.2 |
| Mean | 62332 |
| Median Absolute Deviation (MAD) | 31166 |
| Skewness | -2.203112978 × 10-17 |
| Sum | 7770494116 |
| Variance | 1295082353 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 83073 | 1 | < 0.1% |
| 83102 | 1 | < 0.1% |
| 83101 | 1 | < 0.1% |
| 83100 | 1 | < 0.1% |
| 83099 | 1 | < 0.1% |
| 83098 | 1 | < 0.1% |
| 83097 | 1 | < 0.1% |
| 83096 | 1 | < 0.1% |
| 83095 | 1 | < 0.1% |
| Other values (124653) | 124653 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 124663 | 1 | |
| 124662 | 1 | |
| 124661 | 1 | |
| 124660 | 1 | |
| 124659 | 1 | |
| 124658 | 1 | |
| 124657 | 1 | |
| 124656 | 1 | |
| 124655 | 1 | |
| 124654 | 1 |
FECHA
Date
| Distinct | 4072 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 974.1 KiB |
| Minimum | 2010-01-01 00:00:00 |
|---|---|
| Maximum | 2021-02-28 00:00:00 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.287094 |
| Minimum | 2010 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 2010 |
|---|---|
| 5-th percentile | 2010 |
| Q1 | 2013 |
| median | 2016 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.102117373 |
|---|---|
| Coefficient of variation (CV) | 0.001539293028 |
| Kurtosis | -1.14941778 |
| Mean | 2015.287094 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.1054762854 |
| Sum | 251231735 |
| Variance | 9.623132194 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2016 | 13600 | |
| 2018 | 13489 | |
| 2019 | 12509 | |
| 2017 | 12298 | |
| 2012 | 10912 | |
| 2015 | 10825 | |
| 2013 | 10759 | |
| 2011 | 10363 | |
| 2014 | 10214 | |
| 2020 | 9552 | |
| Other values (2) | 10142 |
| Value | Count | Frequency (%) |
| 2010 | 8691 | |
| 2011 | 10363 | |
| 2012 | 10912 | |
| 2013 | 10759 | |
| 2014 | 10214 | |
| 2015 | 10825 | |
| 2016 | 13600 | |
| 2017 | 12298 | |
| 2018 | 13489 | |
| 2019 | 12509 |
| Value | Count | Frequency (%) |
| 2021 | 1451 | 1.2% |
| 2020 | 9552 | |
| 2019 | 12509 | |
| 2018 | 13489 | |
| 2017 | 12298 | |
| 2016 | 13600 | |
| 2015 | 10825 | |
| 2014 | 10214 | |
| 2013 | 10759 | |
| 2012 | 10912 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.3 KiB |
| ENERO | |
|---|---|
| DICIEMBRE | |
| OCTUBRE | |
| FEBRERO | |
| SEPTIEMBRE | |
| Other values (7) |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.460449372 |
| Min length | 4 |
Characters and Unicode
| Total characters | 805379 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENERO |
|---|---|
| 2nd row | ENERO |
| 3rd row | ENERO |
| 4th row | ENERO |
| 5th row | ENERO |
Common Values
| Value | Count | Frequency (%) |
| ENERO | 11275 | |
| DICIEMBRE | 11234 | |
| OCTUBRE | 11039 | |
| FEBRERO | 10996 | |
| SEPTIEMBRE | 10524 | |
| AGOSTO | 10293 | |
| JULIO | 10110 | |
| NOVIEMBRE | 10045 | |
| MAYO | 10035 | |
| MARZO | 9933 | |
| Other values (2) | 19179 |
Length
| Value | Count | Frequency (%) |
| enero | 11275 | |
| diciembre | 11234 | |
| octubre | 11039 | |
| febrero | 10996 | |
| septiembre | 10524 | |
| agosto | 10293 | |
| julio | 10110 | |
| noviembre | 10045 | |
| mayo | 10035 | |
| marzo | 9933 | |
| Other values (2) | 19179 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 129711 | |
| O | 103629 | |
| R | 95611 | |
| I | 72326 | |
| B | 63407 | |
| M | 51771 | 6.4% |
| A | 39830 | 4.9% |
| T | 31856 | 4.0% |
| N | 30930 | 3.8% |
| U | 30759 | 3.8% |
| Other values (11) | 155549 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 805379 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 129711 | |
| O | 103629 | |
| R | 95611 | |
| I | 72326 | |
| B | 63407 | |
| M | 51771 | 6.4% |
| A | 39830 | 4.9% |
| T | 31856 | 4.0% |
| N | 30930 | 3.8% |
| U | 30759 | 3.8% |
| Other values (11) | 155549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 805379 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 129711 | |
| O | 103629 | |
| R | 95611 | |
| I | 72326 | |
| B | 63407 | |
| M | 51771 | 6.4% |
| A | 39830 | 4.9% |
| T | 31856 | 4.0% |
| N | 30930 | 3.8% |
| U | 30759 | 3.8% |
| Other values (11) | 155549 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 805379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 129711 | |
| O | 103629 | |
| R | 95611 | |
| I | 72326 | |
| B | 63407 | |
| M | 51771 | 6.4% |
| A | 39830 | 4.9% |
| T | 31856 | 4.0% |
| N | 30930 | 3.8% |
| U | 30759 | 3.8% |
| Other values (11) | 155549 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.519175698 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.514841219 |
|---|---|
| Coefficient of variation (CV) | 0.53915424 |
| Kurtosis | -1.251517787 |
| Mean | 6.519175698 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.02194759484 |
| Sum | 812700 |
| Variance | 12.35410879 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 11275 | |
| 12 | 11234 | |
| 10 | 11039 | |
| 2 | 10996 | |
| 9 | 10524 | |
| 8 | 10293 | |
| 7 | 10110 | |
| 11 | 10045 | |
| 5 | 10035 | |
| 3 | 9933 | |
| Other values (2) | 19179 |
| Value | Count | Frequency (%) |
| 1 | 11275 | |
| 2 | 10996 | |
| 3 | 9933 | |
| 4 | 9569 | |
| 5 | 10035 | |
| 6 | 9610 | |
| 7 | 10110 | |
| 8 | 10293 | |
| 9 | 10524 | |
| 10 | 11039 |
| Value | Count | Frequency (%) |
| 12 | 11234 | |
| 11 | 10045 | |
| 10 | 11039 | |
| 9 | 10524 | |
| 8 | 10293 | |
| 7 | 10110 | |
| 6 | 9610 | |
| 5 | 10035 | |
| 4 | 9569 | |
| 3 | 9933 |
DIA
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.45598935 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.825933381 |
|---|---|
| Coefficient of variation (CV) | 0.571036456 |
| Kurtosis | -1.19162181 |
| Mean | 15.45598935 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.03446042033 |
| Sum | 1926790 |
| Variance | 77.89710004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4813 | 3.9% |
| 10 | 4346 | 3.5% |
| 2 | 4297 | 3.4% |
| 5 | 4231 | 3.4% |
| 4 | 4194 | 3.4% |
| 3 | 4188 | 3.4% |
| 9 | 4184 | 3.4% |
| 16 | 4166 | 3.3% |
| 23 | 4153 | 3.3% |
| 22 | 4143 | 3.3% |
| Other values (21) | 81948 |
| Value | Count | Frequency (%) |
| 1 | 4813 | |
| 2 | 4297 | |
| 3 | 4188 | |
| 4 | 4194 | |
| 5 | 4231 | |
| 6 | 4047 | |
| 7 | 4116 | |
| 8 | 4114 | |
| 9 | 4184 | |
| 10 | 4346 |
| Value | Count | Frequency (%) |
| 31 | 2306 | |
| 30 | 3667 | |
| 29 | 3448 | |
| 28 | 3980 | |
| 27 | 4034 | |
| 26 | 3827 | |
| 25 | 3963 | |
| 24 | 3983 | |
| 23 | 4153 | |
| 22 | 4143 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.2 KiB |
| SÁBADO | |
|---|---|
| VIERNES | |
| MIÉRCOLES | |
| MARTES | |
| DOMINGO | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.568637045 |
| Min length | 5 |
Characters and Unicode
| Total characters | 818866 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VIERNES |
|---|---|
| 2nd row | VIERNES |
| 3rd row | VIERNES |
| 4th row | VIERNES |
| 5th row | VIERNES |
Common Values
| Value | Count | Frequency (%) |
| SÁBADO | 20038 | |
| VIERNES | 17866 | |
| MIÉRCOLES | 17617 | |
| MARTES | 17460 | |
| DOMINGO | 17418 | |
| LUNES | 17247 | |
| JUEVES | 17017 |
Length
Pie chart
| Value | Count | Frequency (%) |
| sábado | 20038 | |
| viernes | 17866 | |
| miércoles | 17617 | |
| martes | 17460 | |
| domingo | 17418 | |
| lunes | 17247 | |
| jueves | 17017 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 122090 | |
| S | 107245 | |
| O | 72491 | 8.9% |
| R | 52943 | 6.5% |
| I | 52901 | 6.5% |
| N | 52531 | 6.4% |
| M | 52495 | 6.4% |
| A | 37498 | 4.6% |
| D | 37456 | 4.6% |
| V | 34883 | 4.3% |
| Other values (9) | 196333 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 818866 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 122090 | |
| S | 107245 | |
| O | 72491 | 8.9% |
| R | 52943 | 6.5% |
| I | 52901 | 6.5% |
| N | 52531 | 6.4% |
| M | 52495 | 6.4% |
| A | 37498 | 4.6% |
| D | 37456 | 4.6% |
| V | 34883 | 4.3% |
| Other values (9) | 196333 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 818866 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 122090 | |
| S | 107245 | |
| O | 72491 | 8.9% |
| R | 52943 | 6.5% |
| I | 52901 | 6.5% |
| N | 52531 | 6.4% |
| M | 52495 | 6.4% |
| A | 37498 | 4.6% |
| D | 37456 | 4.6% |
| V | 34883 | 4.3% |
| Other values (9) | 196333 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 781211 | |
| Latin 1 Sup | 37655 | 4.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 122090 | |
| S | 107245 | |
| O | 72491 | |
| R | 52943 | 6.8% |
| I | 52901 | 6.8% |
| N | 52531 | 6.7% |
| M | 52495 | 6.7% |
| A | 37498 | 4.8% |
| D | 37456 | 4.8% |
| V | 34883 | 4.5% |
| Other values (7) | 158678 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Á | 20038 | |
| É | 17617 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.047471984 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.997051983 |
|---|---|
| Coefficient of variation (CV) | 0.4934072406 |
| Kurtosis | -1.262496204 |
| Mean | 4.047471984 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.04565478858 |
| Sum | 504570 |
| Variance | 3.988216624 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 20038 | |
| 5 | 17866 | |
| 3 | 17617 | |
| 2 | 17460 | |
| 7 | 17418 | |
| 1 | 17247 | |
| 4 | 17017 |
| Value | Count | Frequency (%) |
| 1 | 17247 | |
| 2 | 17460 | |
| 3 | 17617 | |
| 4 | 17017 | |
| 5 | 17866 | |
| 6 | 20038 | |
| 7 | 17418 |
| Value | Count | Frequency (%) |
| 7 | 17418 | |
| 6 | 20038 | |
| 5 | 17866 | |
| 4 | 17017 | |
| 3 | 17617 | |
| 2 | 17460 | |
| 1 | 17247 |
| Distinct | 56276 |
|---|---|
| Distinct (%) | 46.7% |
| Missing | 4121 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.124001014 |
| Minimum | 4.5901267 |
|---|---|
| Maximum | 7.239192 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 4.5901267 |
|---|---|
| 5-th percentile | 7.0869384 |
| Q1 | 7.1089894 |
| median | 7.1214947 |
| Q3 | 7.138172825 |
| 95-th percentile | 7.1705574 |
| Maximum | 7.239192 |
| Range | 2.6490653 |
| Interquartile range (IQR) | 0.029183425 |
Descriptive statistics
| Standard deviation | 0.02448501873 |
|---|---|
| Coefficient of variation (CV) | 0.003436975751 |
| Kurtosis | 951.039271 |
| Mean | 7.124001014 |
| Median Absolute Deviation (MAD) | 0.0147122 |
| Skewness | -8.838809403 |
| Sum | 858741.3302 |
| Variance | 0.0005995161424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.1780457 | 1799 | 1.4% |
| 7.1705574 | 1375 | 1.1% |
| 7.144371 | 1163 | 0.9% |
| 7.1646239 | 1073 | 0.9% |
| 7.1206454 | 1004 | 0.8% |
| 7.1389313 | 842 | 0.7% |
| 7.1217487 | 804 | 0.6% |
| 7.1345208 | 768 | 0.6% |
| 7.1481037 | 761 | 0.6% |
| 7.1218431 | 703 | 0.6% |
| Other values (56266) | 110250 | |
| (Missing) | 4121 | 3.3% |
| Value | Count | Frequency (%) |
| 4.5901267 | 1 | |
| 7.0026115 | 1 | |
| 7.0583626 | 1 | |
| 7.060732 | 1 | |
| 7.0608066 | 1 | |
| 7.0653126 | 1 | |
| 7.0685226 | 1 | |
| 7.0708911 | 1 | |
| 7.0709641 | 1 | |
| 7.0710819 | 1 |
| Value | Count | Frequency (%) |
| 7.239192 | 1 | < 0.1% |
| 7.2282032 | 1 | < 0.1% |
| 7.22811 | 1 | < 0.1% |
| 7.212821 | 1 | < 0.1% |
| 7.2123392 | 6 | < 0.1% |
| 7.2109843 | 1 | < 0.1% |
| 7.2107413 | 1 | < 0.1% |
| 7.2103252 | 1 | < 0.1% |
| 7.2099807 | 43 | |
| 7.2092311 | 1 | < 0.1% |
| Distinct | 49226 |
|---|---|
| Distinct (%) | 40.8% |
| Missing | 4121 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.12425775 |
| Minimum | -74.1939044 |
|---|---|
| Maximum | -73.0546927 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 120542 |
| Negative (%) | 96.7% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | -74.1939044 |
|---|---|
| 5-th percentile | -73.14082138 |
| Q1 | -73.1317162 |
| median | -73.12468825 |
| Q3 | -73.1160843 |
| 95-th percentile | -73.10690871 |
| Maximum | -73.0546927 |
| Range | 1.1392117 |
| Interquartile range (IQR) | 0.0156319 |
Descriptive statistics
| Standard deviation | 0.01151763896 |
|---|---|
| Coefficient of variation (CV) | -0.0001575077726 |
| Kurtosis | 617.3517124 |
| Mean | -73.12425775 |
| Median Absolute Deviation (MAD) | 0.00781565 |
| Skewness | -6.816839956 |
| Sum | -8814544.277 |
| Variance | 0.0001326560073 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.1302244 | 1799 | 1.4% |
| -73.135108 | 1375 | 1.1% |
| -73.128085 | 1163 | 0.9% |
| -73.1391405 | 1081 | 0.9% |
| -73.12605 | 1004 | 0.8% |
| -73.1202363 | 848 | 0.7% |
| -73.118304 | 805 | 0.6% |
| -73.11588 | 768 | 0.6% |
| -73.1263899 | 761 | 0.6% |
| -73.139995 | 703 | 0.6% |
| Other values (49216) | 110235 | |
| (Missing) | 4121 | 3.3% |
| Value | Count | Frequency (%) |
| -74.1939044 | 1 | |
| -73.1761256 | 1 | |
| -73.1720572 | 1 | |
| -73.1719437 | 1 | |
| -73.171907 | 1 | |
| -73.1718962 | 1 | |
| -73.1718849 | 1 | |
| -73.1718519 | 1 | |
| -73.171842 | 1 | |
| -73.1717991 | 2 |
| Value | Count | Frequency (%) |
| -73.0546927 | 1 | |
| -73.0548645 | 1 | |
| -73.0577685 | 1 | |
| -73.0581339 | 1 | |
| -73.0636927 | 1 | |
| -73.0637785 | 1 | |
| -73.0638215 | 1 | |
| -73.06445 | 1 | |
| -73.0705792 | 1 | |
| -73.0729062 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.0 KiB |
| URBANA | |
|---|---|
| RURAL | 1058 |
| OTRA | 42 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.990839303 |
| Min length | 4 |
Characters and Unicode
| Total characters | 746836 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | URBANA |
|---|---|
| 2nd row | URBANA |
| 3rd row | URBANA |
| 4th row | URBANA |
| 5th row | URBANA |
Common Values
| Value | Count | Frequency (%) |
| URBANA | 123563 | |
| RURAL | 1058 | 0.8% |
| OTRA | 42 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| urbana | 123563 | |
| rural | 1058 | 0.8% |
| otra | 42 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 248226 | |
| R | 125721 | |
| U | 124621 | |
| B | 123563 | |
| N | 123563 | |
| L | 1058 | 0.1% |
| O | 42 | < 0.1% |
| T | 42 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 746836 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 248226 | |
| R | 125721 | |
| U | 124621 | |
| B | 123563 | |
| N | 123563 | |
| L | 1058 | 0.1% |
| O | 42 | < 0.1% |
| T | 42 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 746836 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 248226 | |
| R | 125721 | |
| U | 124621 | |
| B | 123563 | |
| N | 123563 | |
| L | 1058 | 0.1% |
| O | 42 | < 0.1% |
| T | 42 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 746836 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 248226 | |
| R | 125721 | |
| U | 124621 | |
| B | 123563 | |
| N | 123563 | |
| L | 1058 | 0.1% |
| O | 42 | < 0.1% |
| T | 42 | < 0.1% |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.6 KiB |
| SAN FRANCISCO | |
|---|---|
| CENTRO | |
| ORIENTAL | |
| NORTE | |
| CABECERA DEL LLANO | |
| Other values (16) |
Length
| Max length | 19 |
|---|---|
| Median length | 13 |
| Mean length | 11.3374618 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1413362 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MORRORICO |
|---|---|
| 2nd row | GARCÍA ROVIRA |
| 3rd row | GARCÍA ROVIRA |
| 4th row | SAN FRANCISCO |
| 5th row | OCCIDENTAL |
Common Values
| Value | Count | Frequency (%) |
| SAN FRANCISCO | 14717 | |
| CENTRO | 12665 | |
| ORIENTAL | 12265 | |
| NORTE | 11797 | |
| CABECERA DEL LLANO | 11747 | |
| LA CONCORDIA | 9209 | 7.4% |
| GARCÍA ROVIRA | 8473 | 6.8% |
| OCCIDENTAL | 7194 | 5.8% |
| PROVENZA | 6068 | 4.9% |
| LA PEDREGOSA | 4770 | 3.8% |
| Other values (11) | 25758 |
Length
| Value | Count | Frequency (%) |
| la | 16965 | 8.4% |
| oriental | 16623 | 8.2% |
| del | 14731 | 7.3% |
| san | 14717 | 7.3% |
| francisco | 14717 | 7.3% |
| centro | 12665 | 6.2% |
| norte | 11797 | 5.8% |
| llano | 11747 | 5.8% |
| cabecera | 11747 | 5.8% |
| concordia | 9209 | 4.5% |
| Other values (19) | 67976 |
Most occurring characters
| Value | Count | Frequency (%) |
| 201859 | ||
| A | 162908 | |
| R | 135883 | |
| O | 134114 | |
| C | 126261 | |
| E | 116549 | |
| N | 113351 | |
| L | 84977 | |
| I | 74351 | 5.3% |
| T | 56496 | 4.0% |
| Other values (15) | 206613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1210513 | |
| Space Separator | 201859 | 14.3% |
| Decimal Number | 990 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 162908 | |
| R | 135883 | |
| O | 134114 | |
| C | 126261 | |
| E | 116549 | |
| N | 113351 | |
| L | 84977 | |
| I | 74351 | |
| T | 56496 | 4.7% |
| S | 48580 | 4.0% |
| Other values (11) | 157043 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 551 | |
| 3 | 362 | |
| 2 | 77 | 7.8% |
Space Separator
| Value | Count | Frequency (%) |
| 201859 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1210513 | |
| Common | 202849 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 162908 | |
| R | 135883 | |
| O | 134114 | |
| C | 126261 | |
| E | 116549 | |
| N | 113351 | |
| L | 84977 | |
| I | 74351 | |
| T | 56496 | 4.7% |
| S | 48580 | 4.0% |
| Other values (11) | 157043 |
Common
| Value | Count | Frequency (%) |
| 201859 | ||
| 1 | 551 | 0.3% |
| 3 | 362 | 0.2% |
| 2 | 77 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1404889 | |
| Latin 1 Sup | 8473 | 0.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 201859 | ||
| A | 162908 | |
| R | 135883 | |
| O | 134114 | |
| C | 126261 | |
| E | 116549 | |
| N | 113351 | |
| L | 84977 | |
| I | 74351 | 5.3% |
| T | 56496 | 4.0% |
| Other values (14) | 198140 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Í | 8473 |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.303634599 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 1035 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 13 |
| 95-th percentile | 16 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.059222217 |
|---|---|
| Coefficient of variation (CV) | 0.6092780404 |
| Kurtosis | -1.396679262 |
| Mean | 8.303634599 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.05122617179 |
| Sum | 1035156 |
| Variance | 25.59572944 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 14717 | |
| 15 | 12665 | |
| 13 | 12265 | |
| 1 | 11797 | |
| 12 | 11747 | |
| 6 | 9209 | 7.4% |
| 5 | 8473 | 6.8% |
| 4 | 7194 | 5.8% |
| 10 | 6068 | 4.9% |
| 9 | 4770 | 3.8% |
| Other values (8) | 25758 |
| Value | Count | Frequency (%) |
| 0 | 1035 | 0.8% |
| 1 | 11797 | |
| 2 | 4358 | 3.5% |
| 3 | 14717 | |
| 4 | 7194 | |
| 5 | 8473 | |
| 6 | 9209 | |
| 7 | 2986 | 2.4% |
| 8 | 3221 | 2.6% |
| 9 | 4770 | 3.8% |
| Value | Count | Frequency (%) |
| 17 | 3961 | 3.2% |
| 16 | 2984 | 2.4% |
| 15 | 12665 | |
| 14 | 3003 | 2.4% |
| 13 | 12265 | |
| 12 | 11747 | |
| 11 | 4210 | 3.4% |
| 10 | 6068 | |
| 9 | 4770 | 3.8% |
| 8 | 3221 | 2.6% |
| Distinct | 408 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 263.0 KiB |
| CENTRO | |
|---|---|
| SAN FRANCISCO | 5452 |
| CABECERA DEL LLANO | 5243 |
| LA CONCORDIA | 4670 |
| PROVENZA | 3164 |
| Other values (403) |
Length
| Max length | 32 |
|---|---|
| Median length | 11 |
| Mean length | 11.53181778 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1437591 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 41 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BUENOS AIRES |
|---|---|
| 2nd row | CAMPO HERMOSO |
| 3rd row | CAMPO HERMOSO |
| 4th row | COMUNEROS |
| 5th row | GIRARDOT |
Common Values
| Value | Count | Frequency (%) |
| CENTRO | 10197 | 8.2% |
| SAN FRANCISCO | 5452 | 4.4% |
| CABECERA DEL LLANO | 5243 | 4.2% |
| LA CONCORDIA | 4670 | 3.7% |
| PROVENZA | 3164 | 2.5% |
| SAN ALONSO | 3126 | 2.5% |
| CAMPO HERMOSO | 2624 | 2.1% |
| SOTOMAYOR | 2469 | 2.0% |
| GARCIA ROVIRA | 2469 | 2.0% |
| GIRARDOT | 2444 | 2.0% |
| Other values (398) | 82805 |
Length
| Value | Count | Frequency (%) |
| la | 14796 | 6.3% |
| san | 12289 | 5.2% |
| centro | 10197 | 4.3% |
| del | 9049 | 3.8% |
| de | 7233 | 3.1% |
| francisco | 5453 | 2.3% |
| cabecera | 5243 | 2.2% |
| llano | 5243 | 2.2% |
| concordia | 4670 | 2.0% |
| el | 4660 | 2.0% |
| Other values (402) | 157410 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 203452 | |
| O | 135855 | |
| R | 123013 | 8.6% |
| 111599 | 7.8% | |
| E | 108597 | 7.6% |
| N | 107494 | 7.5% |
| L | 86039 | 6.0% |
| I | 85251 | 5.9% |
| C | 79663 | 5.5% |
| S | 78896 | 5.5% |
| Other values (29) | 317732 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1317684 | |
| Space Separator | 111599 | 7.8% |
| Other Punctuation | 6234 | 0.4% |
| Decimal Number | 831 | 0.1% |
| Open Punctuation | 612 | < 0.1% |
| Close Punctuation | 612 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 203452 | |
| O | 135855 | |
| R | 123013 | |
| E | 108597 | |
| N | 107494 | |
| L | 86039 | 6.5% |
| I | 85251 | 6.5% |
| C | 79663 | 6.0% |
| S | 78896 | 6.0% |
| D | 58400 | 4.4% |
| Other values (17) | 251024 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 361 | |
| 1 | 295 | |
| 3 | 110 | 13.2% |
| 0 | 33 | 4.0% |
| 4 | 18 | 2.2% |
| 5 | 14 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6221 | |
| / | 13 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 111599 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 612 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 612 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1317684 | |
| Common | 119907 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 203452 | |
| O | 135855 | |
| R | 123013 | |
| E | 108597 | |
| N | 107494 | |
| L | 86039 | 6.5% |
| I | 85251 | 6.5% |
| C | 79663 | 6.0% |
| S | 78896 | 6.0% |
| D | 58400 | 4.4% |
| Other values (17) | 251024 |
Common
| Value | Count | Frequency (%) |
| 111599 | ||
| . | 6221 | 5.2% |
| ( | 612 | 0.5% |
| ) | 612 | 0.5% |
| 2 | 361 | 0.3% |
| 1 | 295 | 0.2% |
| 3 | 110 | 0.1% |
| 0 | 33 | < 0.1% |
| - | 19 | < 0.1% |
| 4 | 18 | < 0.1% |
| Other values (2) | 27 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1436346 | |
| Latin 1 Sup | 1245 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 203452 | |
| O | 135855 | |
| R | 123013 | 8.6% |
| 111599 | 7.8% | |
| E | 108597 | 7.6% |
| N | 107494 | 7.5% |
| L | 86039 | 6.0% |
| I | 85251 | 5.9% |
| C | 79663 | 5.5% |
| S | 78896 | 5.5% |
| Other values (27) | 316487 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Ñ | 1189 | |
| Í | 56 | 4.5% |
| Distinct | 278 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.9 KiB |
| CENTRO | 7525 |
|---|---|
| NO REPORTA | 5343 |
| SAN FRANCISCO | 4288 |
| LA CONCORDIA | 4154 |
| CABECERA DEL LLANO | 3788 |
| Other values (273) |
Length
| Max length | 30 |
|---|---|
| Median length | 11 |
| Mean length | 11.55037982 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1439905 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CORREGIMIENTO 1 |
|---|---|
| 2nd row | CENTRO |
| 3rd row | CENTRO |
| 4th row | VILLAS DE SAN IGNACIO |
| 5th row | CORREGIMIENTO 1 |
Common Values
| Value | Count | Frequency (%) |
| CENTRO | 7525 | 6.0% |
| NO REPORTA | 5343 | 4.3% |
| SAN FRANCISCO | 4288 | 3.4% |
| LA CONCORDIA | 4154 | 3.3% |
| CABECERA DEL LLANO | 3788 | 3.0% |
| CAFE MADRID | 3072 | 2.5% |
| UNIVERSIDAD | 2977 | 2.4% |
| REAL DE MINAS | 2952 | 2.4% |
| ANTONIA SANTOS CENTRO | 2933 | 2.4% |
| GARCIA ROVIRA | 2787 | 2.2% |
| Other values (268) | 84844 |
Length
| Value | Count | Frequency (%) |
| la | 12478 | 5.4% |
| san | 11214 | 4.8% |
| centro | 10458 | 4.5% |
| comuna | 8726 | 3.7% |
| de | 6959 | 3.0% |
| del | 6046 | 2.6% |
| no | 5343 | 2.3% |
| reporta | 5343 | 2.3% |
| llano | 4371 | 1.9% |
| cabecera | 4371 | 1.9% |
| Other values (289) | 157917 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 194377 | |
| O | 141390 | |
| R | 120797 | 8.4% |
| N | 117400 | 8.2% |
| E | 113870 | 7.9% |
| 108563 | 7.5% | |
| I | 93655 | 6.5% |
| C | 83111 | 5.8% |
| S | 74575 | 5.2% |
| L | 73110 | 5.1% |
| Other values (31) | 319057 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1327813 | |
| Space Separator | 108563 | 7.5% |
| Decimal Number | 3384 | 0.2% |
| Other Punctuation | 64 | < 0.1% |
| Open Punctuation | 40 | < 0.1% |
| Close Punctuation | 40 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 194377 | |
| O | 141390 | |
| R | 120797 | |
| N | 117400 | |
| E | 113870 | |
| I | 93655 | 7.1% |
| C | 83111 | 6.3% |
| S | 74575 | 5.6% |
| L | 73110 | 5.5% |
| T | 55721 | 4.2% |
| Other values (20) | 259807 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2398 | |
| 3 | 747 | 22.1% |
| 2 | 221 | 6.5% |
| 0 | 13 | 0.4% |
| 5 | 4 | 0.1% |
| 7 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 108563 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 40 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 40 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 64 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1327813 | |
| Common | 112092 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 194377 | |
| O | 141390 | |
| R | 120797 | |
| N | 117400 | |
| E | 113870 | |
| I | 93655 | 7.1% |
| C | 83111 | 6.3% |
| S | 74575 | 5.6% |
| L | 73110 | 5.5% |
| T | 55721 | 4.2% |
| Other values (20) | 259807 |
Common
| Value | Count | Frequency (%) |
| 108563 | ||
| 1 | 2398 | 2.1% |
| 3 | 747 | 0.7% |
| 2 | 221 | 0.2% |
| * | 64 | 0.1% |
| ( | 40 | < 0.1% |
| ) | 40 | < 0.1% |
| 0 | 13 | < 0.1% |
| 5 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1438806 | |
| Latin 1 Sup | 1099 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 194377 | |
| O | 141390 | |
| R | 120797 | 8.4% |
| N | 117400 | 8.2% |
| E | 113870 | 7.9% |
| 108563 | 7.5% | |
| I | 93655 | 6.5% |
| C | 83111 | 5.8% |
| S | 74575 | 5.2% |
| L | 73110 | 5.1% |
| Other values (26) | 317958 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Ñ | 592 | |
| Á | 217 | 19.7% |
| Ó | 144 | 13.1% |
| É | 137 | 12.5% |
| Ì | 9 | 0.8% |
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 123.1 KiB |
| ARTÍCULO 239 | |
|---|---|
| ARTÍCULO 111 | |
| ARTÍCULO 120 | |
| ARTÍCULO 229 | |
| ARTÍCULO 209 | 1518 |
| Other values (24) | 4777 |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 12.00598413 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1496702 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ARTÍCULO 111 |
|---|---|
| 2nd row | ARTÍCULO 111 |
| 3rd row | ARTÍCULO 111 |
| 4th row | ARTÍCULO 111 |
| 5th row | ARTÍCULO 111 |
Common Values
| Value | Count | Frequency (%) |
| ARTÍCULO 239 | 58983 | |
| ARTÍCULO 111 | 22430 | 18.0% |
| ARTÍCULO 120 | 20704 | 16.6% |
| ARTÍCULO 229 | 16251 | 13.0% |
| ARTÍCULO 209 | 1518 | 1.2% |
| ARTÍCULO 103 | 1238 | 1.0% |
| ARTÍCULO 208 | 793 | 0.6% |
| ARTÍCULO 109 | 568 | 0.5% |
| ARTÍCULO 205 | 458 | 0.4% |
| ARTÍCULO 244 | 432 | 0.3% |
| Other values (19) | 1288 | 1.0% |
Length
| Value | Count | Frequency (%) |
| artículo | 124651 | |
| 239 | 58983 | |
| 111 | 22430 | 9.0% |
| 120 | 20704 | 8.3% |
| 229 | 16251 | 6.5% |
| 209 | 1518 | 0.6% |
| 103 | 1238 | 0.5% |
| 208 | 793 | 0.3% |
| 109 | 568 | 0.2% |
| 210 | 504 | 0.2% |
| Other values (19) | 2071 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 125048 | |
| 125048 | ||
| R | 124675 | |
| O | 124675 | |
| T | 124663 | |
| Í | 124651 | |
| C | 124651 | |
| U | 124651 | |
| L | 124651 | |
| 2 | 116622 | |
| Other values (12) | 257367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 997701 | |
| Decimal Number | 373953 | 25.0% |
| Space Separator | 125048 | 8.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 125048 | |
| R | 124675 | |
| O | 124675 | |
| T | 124663 | |
| Í | 124651 | |
| C | 124651 | |
| U | 124651 | |
| L | 124651 | |
| N | 12 | < 0.1% |
| E | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 116622 | |
| 1 | 90637 | |
| 9 | 77392 | |
| 3 | 60257 | |
| 0 | 26301 | 7.0% |
| 4 | 908 | 0.2% |
| 8 | 866 | 0.2% |
| 5 | 462 | 0.1% |
| 6 | 367 | 0.1% |
| 7 | 141 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 125048 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 997701 | |
| Common | 499001 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 125048 | |
| R | 124675 | |
| O | 124675 | |
| T | 124663 | |
| Í | 124651 | |
| C | 124651 | |
| U | 124651 | |
| L | 124651 | |
| N | 12 | < 0.1% |
| E | 12 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 125048 | ||
| 2 | 116622 | |
| 1 | 90637 | |
| 9 | 77392 | |
| 3 | 60257 | |
| 0 | 26301 | 5.3% |
| 4 | 908 | 0.2% |
| 8 | 866 | 0.2% |
| 5 | 462 | 0.1% |
| 6 | 367 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1372051 | |
| Latin 1 Sup | 124651 | 8.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 125048 | |
| 125048 | ||
| R | 124675 | |
| O | 124675 | |
| T | 124663 | |
| C | 124651 | |
| U | 124651 | |
| L | 124651 | |
| 2 | 116622 | |
| 1 | 90637 | |
| Other values (11) | 166730 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Í | 124651 |
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 123.2 KiB |
| HURTO A PERSONAS | |
|---|---|
| LESIONES PERSONALES | |
| LESIONES CULPOSAS (EN ACCIDENTE DE TRANSITO) | |
| VIOLENCIA INTRAFAMILIAR | |
| HURTO A ENTIDADES COMERCIALES | |
| Other values (35) |
Length
| Max length | 95 |
|---|---|
| Median length | 19 |
| Mean length | 23.85142344 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2973390 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LESIONES PERSONALES |
|---|---|
| 2nd row | LESIONES PERSONALES |
| 3rd row | LESIONES PERSONALES |
| 4th row | LESIONES PERSONALES |
| 5th row | LESIONES PERSONALES |
Common Values
| Value | Count | Frequency (%) |
| HURTO A PERSONAS | 43204 | |
| LESIONES PERSONALES | 22430 | |
| LESIONES CULPOSAS (EN ACCIDENTE DE TRANSITO) | 20695 | |
| VIOLENCIA INTRAFAMILIAR | 16251 | 13.0% |
| HURTO A ENTIDADES COMERCIALES | 8089 | 6.5% |
| HURTO A RESIDENCIAS | 4336 | 3.5% |
| HURTO A MOTOCICLETAS | 3101 | 2.5% |
| ACTOS SEXUALES CON MENOR DE 14 AÑOS | 1518 | 1.2% |
| HOMICIDIO | 1238 | 1.0% |
| ACCESO CARNAL ABUSIVO CON MENOR DE 14 AÑOS | 793 | 0.6% |
| Other values (30) | 3008 | 2.4% |
Length
| Value | Count | Frequency (%) |
| a | 59033 | |
| hurto | 58983 | |
| personas | 43204 | |
| lesiones | 43139 | |
| de | 24240 | 5.8% |
| personales | 22431 | 5.4% |
| en | 21741 | 5.2% |
| accidente | 21263 | 5.1% |
| culposas | 20704 | 5.0% |
| transito | 20695 | 5.0% |
| Other values (65) | 82479 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 334651 | |
| S | 322239 | |
| 293249 | ||
| A | 292202 | |
| O | 261997 | |
| N | 227609 | |
| I | 205469 | 6.9% |
| R | 197334 | 6.6% |
| T | 158086 | 5.3% |
| L | 135813 | 4.6% |
| Other values (22) | 544741 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2632829 | |
| Space Separator | 293249 | 9.9% |
| Open Punctuation | 21304 | 0.7% |
| Close Punctuation | 21304 | 0.7% |
| Decimal Number | 4704 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 334651 | |
| S | 322239 | |
| A | 292202 | |
| O | 261997 | |
| N | 227609 | |
| I | 205469 | |
| R | 197334 | |
| T | 158086 | |
| L | 135813 | 5.2% |
| C | 120322 | 4.6% |
| Other values (16) | 377107 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2352 | |
| 4 | 2343 | |
| 8 | 9 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 293249 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21304 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21304 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2632829 | |
| Common | 340561 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 334651 | |
| S | 322239 | |
| A | 292202 | |
| O | 261997 | |
| N | 227609 | |
| I | 205469 | |
| R | 197334 | |
| T | 158086 | |
| L | 135813 | 5.2% |
| C | 120322 | 4.6% |
| Other values (16) | 377107 |
Common
| Value | Count | Frequency (%) |
| 293249 | ||
| ( | 21304 | 6.3% |
| ) | 21304 | 6.3% |
| 1 | 2352 | 0.7% |
| 4 | 2343 | 0.7% |
| 8 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2969616 | |
| Latin 1 Sup | 3774 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 334651 | |
| S | 322239 | |
| 293249 | ||
| A | 292202 | |
| O | 261997 | |
| N | 227609 | |
| I | 205469 | 6.9% |
| R | 197334 | 6.6% |
| T | 158086 | 5.3% |
| L | 135813 | 4.6% |
| Other values (18) | 540967 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Ñ | 2359 | |
| Ó | 758 | 20.1% |
| Á | 568 | 15.1% |
| Í | 89 | 2.4% |
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 126.8 KiB |
| HURTO A PERSONAS | |
|---|---|
| LESIONES PERSONALES | |
| LESIONES CULPOSAS (EN ACCIDENTE DE TRANSITO) | |
| VIOLENCIA INTRAFAMILIAR | |
| HURTO A ENTIDADES COMERCIALES | |
| Other values (106) |
Length
| Max length | 95 |
|---|---|
| Median length | 19 |
| Mean length | 23.18374337 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2890155 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LESIONES PERSONALES |
|---|---|
| 2nd row | LESIONES PERSONALES |
| 3rd row | LESIONES PERSONALES |
| 4th row | LESIONES PERSONALES |
| 5th row | LESIONES PERSONALES |
Common Values
| Value | Count | Frequency (%) |
| HURTO A PERSONAS | 39101 | |
| LESIONES PERSONALES | 21030 | |
| LESIONES CULPOSAS (EN ACCIDENTE DE TRANSITO) | 19360 | |
| VIOLENCIA INTRAFAMILIAR | 15953 | |
| HURTO A ENTIDADES COMERCIALES | 7766 | 6.2% |
| HURTO A RESIDENCIAS | 4105 | 3.3% |
| HURTO A MOTOCICLETAS | 2871 | 2.3% |
| ACTOS SEXUALES CON MENOR DE 14 AÑOS | 1436 | 1.2% |
| ATRACO | 1344 | 1.1% |
| RIÑAS | 1201 | 1.0% |
| Other values (101) | 10496 | 8.4% |
Length
| Value | Count | Frequency (%) |
| a | 54531 | |
| hurto | 54521 | |
| lesiones | 41259 | |
| personas | 39101 | |
| de | 23927 | 5.9% |
| personales | 21030 | 5.2% |
| en | 20983 | 5.2% |
| accidente | 20065 | 5.0% |
| transito | 19574 | 4.8% |
| culposas | 19387 | 4.8% |
| Other values (165) | 90524 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 322332 | |
| S | 304923 | |
| A | 289221 | |
| 280252 | ||
| O | 256070 | |
| N | 219479 | |
| I | 201593 | 7.0% |
| R | 193847 | 6.7% |
| T | 155009 | 5.4% |
| L | 133294 | 4.6% |
| Other values (26) | 534135 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2563716 | |
| Space Separator | 280252 | 9.7% |
| Open Punctuation | 19971 | 0.7% |
| Close Punctuation | 19891 | 0.7% |
| Decimal Number | 4450 | 0.2% |
| Other Punctuation | 1855 | 0.1% |
| Dash Punctuation | 20 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 322332 | |
| S | 304923 | |
| A | 289221 | |
| O | 256070 | |
| N | 219479 | |
| I | 201593 | |
| R | 193847 | |
| T | 155009 | |
| L | 133294 | 5.2% |
| C | 118200 | 4.6% |
| Other values (17) | 369748 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2225 | |
| 4 | 2221 | |
| 8 | 4 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1835 | |
| . | 20 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 280252 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 19971 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 19891 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2563716 | |
| Common | 326439 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 322332 | |
| S | 304923 | |
| A | 289221 | |
| O | 256070 | |
| N | 219479 | |
| I | 201593 | |
| R | 193847 | |
| T | 155009 | |
| L | 133294 | 5.2% |
| C | 118200 | 4.6% |
| Other values (17) | 369748 |
Common
| Value | Count | Frequency (%) |
| 280252 | ||
| ( | 19971 | 6.1% |
| ) | 19891 | 6.1% |
| 1 | 2225 | 0.7% |
| 4 | 2221 | 0.7% |
| / | 1835 | 0.6% |
| . | 20 | < 0.1% |
| - | 20 | < 0.1% |
| 8 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2885297 | |
| Latin 1 Sup | 4858 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 322332 | |
| S | 304923 | |
| A | 289221 | |
| 280252 | ||
| O | 256070 | |
| N | 219479 | |
| I | 201593 | 7.0% |
| R | 193847 | 6.7% |
| T | 155009 | 5.4% |
| L | 133294 | 4.6% |
| Other values (22) | 529277 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Ñ | 3557 | |
| Ó | 735 | 15.1% |
| Á | 491 | 10.1% |
| Í | 75 | 1.5% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.1 KiB |
| LESIONES NO FATALES | |
|---|---|
| VIOLENCIA SEXUAL | 4155 |
| LESIONES FATALES | 1832 |
| NO REPORTA | 6 |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 18.8554904 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2350582 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LESIONES NO FATALES |
|---|---|
| 2nd row | LESIONES NO FATALES |
| 3rd row | LESIONES NO FATALES |
| 4th row | LESIONES NO FATALES |
| 5th row | LESIONES NO FATALES |
Common Values
| Value | Count | Frequency (%) |
| LESIONES NO FATALES | 118670 | |
| VIOLENCIA SEXUAL | 4155 | 3.3% |
| LESIONES FATALES | 1832 | 1.5% |
| NO REPORTA | 6 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| lesiones | 120502 | |
| fatales | 120502 | |
| no | 118676 | |
| violencia | 4155 | 1.1% |
| sexual | 4155 | 1.1% |
| reporta | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 369822 | |
| S | 365661 | |
| A | 249320 | |
| L | 249314 | |
| O | 243339 | |
| N | 243333 | |
| 243333 | ||
| I | 128812 | 5.5% |
| T | 120508 | 5.1% |
| F | 120502 | 5.1% |
| Other values (6) | 16638 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2107249 | |
| Space Separator | 243333 | 10.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 369822 | |
| S | 365661 | |
| A | 249320 | |
| L | 249314 | |
| O | 243339 | |
| N | 243333 | |
| I | 128812 | 6.1% |
| T | 120508 | 5.7% |
| F | 120502 | 5.7% |
| V | 4155 | 0.2% |
| Other values (5) | 12483 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 243333 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2107249 | |
| Common | 243333 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 369822 | |
| S | 365661 | |
| A | 249320 | |
| L | 249314 | |
| O | 243339 | |
| N | 243333 | |
| I | 128812 | 6.1% |
| T | 120508 | 5.7% |
| F | 120502 | 5.7% |
| V | 4155 | 0.2% |
| Other values (5) | 12483 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 243333 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2350582 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 369822 | |
| S | 365661 | |
| A | 249320 | |
| L | 249314 | |
| O | 243339 | |
| N | 243333 | |
| 243333 | ||
| I | 128812 | 5.5% |
| T | 120508 | 5.1% |
| F | 120502 | 5.1% |
| Other values (6) | 16638 | 0.7% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.0 KiB |
| MASCULINO | |
|---|---|
| FEMENINO | |
| NO REPORTA |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.621274957 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1074754 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MASCULINO |
|---|---|
| 2nd row | MASCULINO |
| 3rd row | MASCULINO |
| 4th row | MASCULINO |
| 5th row | MASCULINO |
Common Values
| Value | Count | Frequency (%) |
| MASCULINO | 61754 | |
| FEMENINO | 55061 | |
| NO REPORTA | 7848 | 6.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| masculino | 61754 | |
| femenino | 55061 | |
| no | 7848 | 5.9% |
| reporta | 7848 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 179724 | |
| O | 132511 | |
| E | 117970 | |
| M | 116815 | |
| I | 116815 | |
| A | 69602 | 6.5% |
| S | 61754 | 5.7% |
| C | 61754 | 5.7% |
| U | 61754 | 5.7% |
| L | 61754 | 5.7% |
| Other values (5) | 94301 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1066906 | |
| Space Separator | 7848 | 0.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 179724 | |
| O | 132511 | |
| E | 117970 | |
| M | 116815 | |
| I | 116815 | |
| A | 69602 | 6.5% |
| S | 61754 | 5.8% |
| C | 61754 | 5.8% |
| U | 61754 | 5.8% |
| L | 61754 | 5.8% |
| Other values (4) | 86453 |
Space Separator
| Value | Count | Frequency (%) |
| 7848 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1066906 | |
| Common | 7848 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 179724 | |
| O | 132511 | |
| E | 117970 | |
| M | 116815 | |
| I | 116815 | |
| A | 69602 | 6.5% |
| S | 61754 | 5.8% |
| C | 61754 | 5.8% |
| U | 61754 | 5.8% |
| L | 61754 | 5.8% |
| Other values (4) | 86453 |
Common
| Value | Count | Frequency (%) |
| 7848 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1074754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 179724 | |
| O | 132511 | |
| E | 117970 | |
| M | 116815 | |
| I | 116815 | |
| A | 69602 | 6.5% |
| S | 61754 | 5.7% |
| C | 61754 | 5.7% |
| U | 61754 | 5.7% |
| L | 61754 | 5.7% |
| Other values (5) | 94301 |
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.03731661 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 28 |
| Zeros (%) | < 0.1% |
| Negative | 8709 |
| Negative (%) | 7.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 21 |
| median | 29 |
| Q3 | 41 |
| 95-th percentile | 61 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.65157659 |
|---|---|
| Coefficient of variation (CV) | 0.5365018116 |
| Kurtosis | 0.2951144199 |
| Mean | 31.03731661 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.31940018 |
| Sum | 3869205 |
| Variance | 277.2750028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 8709 | 7.0% |
| 25 | 4210 | 3.4% |
| 23 | 4038 | 3.2% |
| 22 | 3883 | 3.1% |
| 26 | 3881 | 3.1% |
| 21 | 3854 | 3.1% |
| 24 | 3828 | 3.1% |
| 20 | 3720 | 3.0% |
| 27 | 3676 | 2.9% |
| 30 | 3603 | 2.9% |
| Other values (89) | 81261 |
| Value | Count | Frequency (%) |
| -1 | 8709 | |
| 0 | 28 | < 0.1% |
| 1 | 106 | 0.1% |
| 2 | 177 | 0.1% |
| 3 | 250 | 0.2% |
| 4 | 254 | 0.2% |
| 5 | 286 | 0.2% |
| 6 | 275 | 0.2% |
| 7 | 279 | 0.2% |
| 8 | 313 | 0.3% |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 98 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 94 | 5 | < 0.1% |
| 93 | 6 | < 0.1% |
| 92 | 12 | < 0.1% |
| 91 | 15 | |
| 90 | 22 | |
| 89 | 16 | |
| 88 | 32 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.2 KiB |
| ADULTEZ | |
|---|---|
| JOVENES | |
| NO REPORTA | |
| ADOLESCENCIA | |
| PERSONA MAYOR | |
| Other values (2) | 3126 |
Length
| Max length | 16 |
|---|---|
| Median length | 7 |
| Mean length | 7.942308464 |
| Min length | 7 |
Characters and Unicode
| Total characters | 990112 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ADULTEZ |
|---|---|
| 2nd row | JOVENES |
| 3rd row | JOVENES |
| 4th row | ADULTEZ |
| 5th row | JOVENES |
Common Values
| Value | Count | Frequency (%) |
| ADULTEZ | 58177 | |
| JOVENES | 40843 | |
| NO REPORTA | 8152 | 6.5% |
| ADOLESCENCIA | 7309 | 5.9% |
| PERSONA MAYOR | 7056 | 5.7% |
| INFANCIA | 1750 | 1.4% |
| PRIMERA INFANCIA | 1376 | 1.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| adultez | 58177 | |
| jovenes | 40843 | |
| no | 8152 | 5.8% |
| reporta | 8152 | 5.8% |
| adolescencia | 7309 | 5.2% |
| persona | 7056 | 5.0% |
| mayor | 7056 | 5.0% |
| infancia | 3126 | 2.2% |
| primera | 1376 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 171065 | |
| A | 102687 | |
| O | 78568 | 7.9% |
| N | 69612 | 7.0% |
| T | 66329 | 6.7% |
| D | 65486 | 6.6% |
| L | 65486 | 6.6% |
| U | 58177 | 5.9% |
| Z | 58177 | 5.9% |
| S | 55208 | 5.6% |
| Other values (10) | 199317 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 973528 | |
| Space Separator | 16584 | 1.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 171065 | |
| A | 102687 | |
| O | 78568 | |
| N | 69612 | 7.2% |
| T | 66329 | 6.8% |
| D | 65486 | 6.7% |
| L | 65486 | 6.7% |
| U | 58177 | 6.0% |
| Z | 58177 | 6.0% |
| S | 55208 | 5.7% |
| Other values (9) | 182733 |
Space Separator
| Value | Count | Frequency (%) |
| 16584 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 973528 | |
| Common | 16584 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 171065 | |
| A | 102687 | |
| O | 78568 | |
| N | 69612 | 7.2% |
| T | 66329 | 6.8% |
| D | 65486 | 6.7% |
| L | 65486 | 6.7% |
| U | 58177 | 6.0% |
| Z | 58177 | 6.0% |
| S | 55208 | 5.7% |
| Other values (9) | 182733 |
Common
| Value | Count | Frequency (%) |
| 16584 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 171065 | |
| A | 102687 | |
| O | 78568 | 7.9% |
| N | 69612 | 7.0% |
| T | 66329 | 6.7% |
| D | 65486 | 6.6% |
| L | 65486 | 6.6% |
| U | 58177 | 5.9% |
| Z | 58177 | 5.9% |
| S | 55208 | 5.6% |
| Other values (10) | 199317 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.198487121 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 8152 |
| Zeros (%) | 6.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.383507421 |
|---|---|
| Coefficient of variation (CV) | 0.3295252269 |
| Kurtosis | 3.199436441 |
| Mean | 4.198487121 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.843647666 |
| Sum | 523396 |
| Variance | 1.914092784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 58177 | |
| 4 | 40843 | |
| 0 | 8152 | 6.5% |
| 3 | 7309 | 5.9% |
| 6 | 7056 | 5.7% |
| 2 | 1750 | 1.4% |
| 1 | 1376 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 8152 | 6.5% |
| 1 | 1376 | 1.1% |
| 2 | 1750 | 1.4% |
| 3 | 7309 | 5.9% |
| 4 | 40843 | |
| 5 | 58177 | |
| 6 | 7056 | 5.7% |
| Value | Count | Frequency (%) |
| 6 | 7056 | 5.7% |
| 5 | 58177 | |
| 4 | 40843 | |
| 3 | 7309 | 5.9% |
| 2 | 1750 | 1.4% |
| 1 | 1376 | 1.1% |
| 0 | 8152 | 6.5% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.2 KiB |
| SOLTERO | |
|---|---|
| CASADO | |
| UNION LIBRE | |
| NO REPORTA | |
| DIVORCIADO | 1872 |
| Other values (2) | 2131 |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.813761902 |
| Min length | 5 |
Characters and Unicode
| Total characters | 974087 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UNION LIBRE |
|---|---|
| 2nd row | SOLTERO |
| 3rd row | SOLTERO |
| 4th row | CASADO |
| 5th row | UNION LIBRE |
Common Values
| Value | Count | Frequency (%) |
| SOLTERO | 60871 | |
| CASADO | 26607 | |
| UNION LIBRE | 25020 | |
| NO REPORTA | 8162 | 6.5% |
| DIVORCIADO | 1872 | 1.5% |
| VIUDO | 1420 | 1.1% |
| SEPARADO | 711 | 0.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| soltero | 60871 | |
| casado | 26607 | |
| union | 25020 | |
| libre | 25020 | |
| no | 8162 | 5.2% |
| reporta | 8162 | 5.2% |
| divorciado | 1872 | 1.2% |
| viudo | 1420 | 0.9% |
| separado | 711 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 195568 | |
| R | 104798 | |
| E | 94764 | |
| S | 88189 | |
| L | 85891 | |
| T | 69033 | 7.1% |
| A | 64670 | 6.6% |
| N | 58202 | 6.0% |
| I | 55204 | 5.7% |
| 33182 | 3.4% | |
| Other values (6) | 124586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 940905 | |
| Space Separator | 33182 | 3.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 195568 | |
| R | 104798 | |
| E | 94764 | |
| S | 88189 | |
| L | 85891 | |
| T | 69033 | 7.3% |
| A | 64670 | 6.9% |
| N | 58202 | 6.2% |
| I | 55204 | 5.9% |
| D | 32482 | 3.5% |
| Other values (5) | 92104 |
Space Separator
| Value | Count | Frequency (%) |
| 33182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 940905 | |
| Common | 33182 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 195568 | |
| R | 104798 | |
| E | 94764 | |
| S | 88189 | |
| L | 85891 | |
| T | 69033 | 7.3% |
| A | 64670 | 6.9% |
| N | 58202 | 6.2% |
| I | 55204 | 5.9% |
| D | 32482 | 3.5% |
| Other values (5) | 92104 |
Common
| Value | Count | Frequency (%) |
| 33182 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 974087 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 195568 | |
| R | 104798 | |
| E | 94764 | |
| S | 88189 | |
| L | 85891 | |
| T | 69033 | 7.1% |
| A | 64670 | 6.6% |
| N | 58202 | 6.0% |
| I | 55204 | 5.7% |
| 33182 | 3.4% | |
| Other values (6) | 124586 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.5 KiB |
| A PIE | |
|---|---|
| CONDUCTOR MOTOCICLETA | |
| CONDUCTOR VEHICULO | 5139 |
| PASAJERO BUS | 1782 |
| BICICLETA | 1037 |
| Other values (9) | 4260 |
Length
| Max length | 21 |
|---|---|
| Median length | 5 |
| Mean length | 8.068737316 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1005873 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | A PIE |
|---|---|
| 2nd row | A PIE |
| 3rd row | A PIE |
| 4th row | A PIE |
| 5th row | A PIE |
Common Values
| Value | Count | Frequency (%) |
| A PIE | 96332 | |
| CONDUCTOR MOTOCICLETA | 16113 | 12.9% |
| CONDUCTOR VEHICULO | 5139 | 4.1% |
| PASAJERO BUS | 1782 | 1.4% |
| BICICLETA | 1037 | 0.8% |
| CONDUCTOR TAXI | 1037 | 0.8% |
| PASAJERO MOTOCICLETA | 935 | 0.8% |
| PASAJERO TAXI | 757 | 0.6% |
| NO REPORTA | 717 | 0.6% |
| PASAJERO VEHICULO | 421 | 0.3% |
| Other values (4) | 393 | 0.3% |
Length
| Value | Count | Frequency (%) |
| a | 96332 | |
| pie | 96332 | |
| conductor | 22562 | 9.1% |
| motocicleta | 17048 | 6.9% |
| vehiculo | 5560 | 2.2% |
| pasajero | 4015 | 1.6% |
| bus | 2055 | 0.8% |
| taxi | 1794 | 0.7% |
| bicicleta | 1037 | 0.4% |
| no | 717 | 0.3% |
| Other values (4) | 837 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 124963 | |
| E | 124830 | |
| 123626 | ||
| I | 122808 | |
| P | 101064 | |
| O | 90349 | |
| C | 86855 | |
| T | 60323 | |
| U | 30177 | 3.0% |
| R | 28131 | 2.8% |
| Other values (10) | 112747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 882247 | |
| Space Separator | 123626 | 12.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 124963 | |
| E | 124830 | |
| I | 122808 | |
| P | 101064 | |
| O | 90349 | |
| C | 86855 | |
| T | 60323 | |
| U | 30177 | 3.4% |
| R | 28131 | 3.2% |
| L | 23645 | 2.7% |
| Other values (9) | 89102 |
Space Separator
| Value | Count | Frequency (%) |
| 123626 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 882247 | |
| Common | 123626 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 124963 | |
| E | 124830 | |
| I | 122808 | |
| P | 101064 | |
| O | 90349 | |
| C | 86855 | |
| T | 60323 | |
| U | 30177 | 3.4% |
| R | 28131 | 3.2% |
| L | 23645 | 2.7% |
| Other values (9) | 89102 |
Common
| Value | Count | Frequency (%) |
| 123626 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1005873 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 124963 | |
| E | 124830 | |
| 123626 | ||
| I | 122808 | |
| P | 101064 | |
| O | 90349 | |
| C | 86855 | |
| T | 60323 | |
| U | 30177 | 3.0% |
| R | 28131 | 2.8% |
| Other values (10) | 112747 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 122.5 KiB |
| A PIE | |
|---|---|
| CONDUCTOR MOTOCICLETA | |
| CONDUCTOR VEHICULO | |
| PASAJERO MOTOCICLETA | 6094 |
| CONDUCTOR TAXI | 2868 |
| Other values (8) | 2905 |
Length
| Max length | 21 |
|---|---|
| Median length | 5 |
| Mean length | 9.256692042 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1153967 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | A PIE |
|---|---|
| 2nd row | A PIE |
| 3rd row | A PIE |
| 4th row | A PIE |
| 5th row | A PIE |
Common Values
| Value | Count | Frequency (%) |
| A PIE | 85850 | |
| CONDUCTOR MOTOCICLETA | 14258 | 11.4% |
| CONDUCTOR VEHICULO | 12688 | 10.2% |
| PASAJERO MOTOCICLETA | 6094 | 4.9% |
| CONDUCTOR TAXI | 2868 | 2.3% |
| PASAJERO BUS | 1454 | 1.2% |
| PASAJERO TAXI | 523 | 0.4% |
| BICICLETA | 333 | 0.3% |
| NO REPORTA | 270 | 0.2% |
| PASAJERO VEHICULO | 152 | 0.1% |
| Other values (3) | 173 | 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 85850 | |
| pie | 85850 | |
| conductor | 29886 | 12.0% |
| motocicleta | 20352 | 8.2% |
| vehiculo | 12840 | 5.2% |
| pasajero | 8323 | 3.3% |
| taxi | 3391 | 1.4% |
| bus | 1526 | 0.6% |
| bicicleta | 333 | 0.1% |
| no | 270 | 0.1% |
| Other values (4) | 372 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 128071 | |
| A | 126845 | |
| 124330 | ||
| I | 123100 | |
| O | 122280 | |
| C | 113982 | |
| P | 94444 | |
| T | 74686 | |
| U | 44253 | 3.8% |
| R | 38851 | 3.4% |
| Other values (10) | 163125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1029637 | |
| Space Separator | 124330 | 10.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 128071 | |
| A | 126845 | |
| I | 123100 | |
| O | 122280 | |
| C | 113982 | |
| P | 94444 | |
| T | 74686 | |
| U | 44253 | 4.3% |
| R | 38851 | 3.8% |
| L | 33526 | 3.3% |
| Other values (9) | 129599 |
Space Separator
| Value | Count | Frequency (%) |
| 124330 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1029637 | |
| Common | 124330 | 10.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 128071 | |
| A | 126845 | |
| I | 123100 | |
| O | 122280 | |
| C | 113982 | |
| P | 94444 | |
| T | 74686 | |
| U | 44253 | 4.3% |
| R | 38851 | 3.8% |
| L | 33526 | 3.3% |
| Other values (9) | 129599 |
Common
| Value | Count | Frequency (%) |
| 124330 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1153967 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 128071 | |
| A | 126845 | |
| 124330 | ||
| I | 123100 | |
| O | 122280 | |
| C | 113982 | |
| P | 94444 | |
| T | 74686 | |
| U | 44253 | 3.8% |
| R | 38851 | 3.4% |
| Other values (10) | 163125 |
| Distinct | 36 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 123.2 KiB |
| CONTUNDENTES | |
|---|---|
| SIN EMPLEO DE ARMAS | |
| ARMA BLANCA/CORTOPUNZANTE | |
| VEHICULO | |
| ARMA DE FUEGO | |
| Other values (31) |
Length
| Max length | 34 |
|---|---|
| Median length | 13 |
| Mean length | 15.11082679 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1883761 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ARMA BLANCA/CORTOPUNZANTE |
|---|---|
| 2nd row | ARMA BLANCA/CORTOPUNZANTE |
| 3rd row | ARMA BLANCA/CORTOPUNZANTE |
| 4th row | ARMA BLANCA/CORTOPUNZANTE |
| 5th row | ARMA BLANCA/CORTOPUNZANTE |
Common Values
| Value | Count | Frequency (%) |
| CONTUNDENTES | 35333 | |
| SIN EMPLEO DE ARMAS | 33671 | |
| ARMA BLANCA/CORTOPUNZANTE | 20301 | |
| VEHICULO | 13132 | 10.5% |
| ARMA DE FUEGO | 7779 | 6.2% |
| MOTO | 7739 | 6.2% |
| LLAVE MAESTRA | 2368 | 1.9% |
| NO REPORTA | 2107 | 1.7% |
| PALANCAS | 906 | 0.7% |
| ESCOPOLAMINA | 609 | 0.5% |
| Other values (26) | 718 | 0.6% |
Length
| Value | Count | Frequency (%) |
| de | 41460 | |
| contundentes | 35333 | |
| sin | 33671 | |
| empleo | 33671 | |
| armas | 33671 | |
| arma | 28080 | |
| blanca/cortopunzante | 20301 | |
| vehiculo | 13132 | 4.9% |
| fuego | 7779 | 2.9% |
| moto | 7739 | 2.9% |
| Other values (47) | 11580 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 229081 | |
| N | 204546 | |
| A | 199106 | |
| O | 152020 | 8.1% |
| 141754 | 7.5% | |
| T | 124015 | 6.6% |
| S | 106849 | 5.7% |
| M | 106480 | 5.7% |
| C | 91229 | 4.8% |
| R | 89010 | 4.7% |
| Other values (17) | 439671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1721679 | |
| Space Separator | 141754 | 7.5% |
| Other Punctuation | 20326 | 1.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 229081 | |
| N | 204546 | |
| A | 199106 | |
| O | 152020 | |
| T | 124015 | 7.2% |
| S | 106849 | 6.2% |
| M | 106480 | 6.2% |
| C | 91229 | 5.3% |
| R | 89010 | 5.2% |
| D | 77291 | 4.5% |
| Other values (13) | 342052 |
Space Separator
| Value | Count | Frequency (%) |
| 141754 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 20326 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1721679 | |
| Common | 162082 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 229081 | |
| N | 204546 | |
| A | 199106 | |
| O | 152020 | |
| T | 124015 | 7.2% |
| S | 106849 | 6.2% |
| M | 106480 | 6.2% |
| C | 91229 | 5.3% |
| R | 89010 | 5.2% |
| D | 77291 | 4.5% |
| Other values (13) | 342052 |
Common
| Value | Count | Frequency (%) |
| 141754 | ||
| / | 20326 | 12.5% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1883761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 229081 | |
| N | 204546 | |
| A | 199106 | |
| O | 152020 | 8.1% |
| 141754 | 7.5% | |
| T | 124015 | 6.6% |
| S | 106849 | 5.7% |
| M | 106480 | 5.7% |
| C | 91229 | 4.8% |
| R | 89010 | 4.7% |
| Other values (17) | 439671 |
| Distinct | 69813 |
|---|---|
| Distinct (%) | 57.9% |
| Missing | 4121 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5462961437 |
| Minimum | 0.001486 |
|---|---|
| Maximum | 300.340824 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 974.1 KiB |
Quantile statistics
| Minimum | 0.001486 |
|---|---|
| 5-th percentile | 0.1079592 |
| Q1 | 0.252911 |
| median | 0.400576 |
| Q3 | 0.620965 |
| 95-th percentile | 1.5417977 |
| Maximum | 300.340824 |
| Range | 300.339338 |
| Interquartile range (IQR) | 0.368054 |
Descriptive statistics
| Standard deviation | 1.013872215 |
|---|---|
| Coefficient of variation (CV) | 1.855902201 |
| Kurtosis | 63421.97897 |
| Mean | 0.5462961437 |
| Median Absolute Deviation (MAD) | 0.172421 |
| Skewness | 214.9790344 |
| Sum | 65851.62976 |
| Variance | 1.027936869 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.198907 | 1799 | 1.4% |
| 1.2112 | 1375 | 1.1% |
| 0.270129 | 1164 | 0.9% |
| 0.419048 | 1073 | 0.9% |
| 0.195689 | 1004 | 0.8% |
| 1.016831 | 842 | 0.7% |
| 0.155217 | 804 | 0.6% |
| 0.483664 | 768 | 0.6% |
| 0.210898 | 761 | 0.6% |
| 0.547459 | 703 | 0.6% |
| Other values (69803) | 110249 | |
| (Missing) | 4121 | 3.3% |
| Value | Count | Frequency (%) |
| 0.001486 | 1 | |
| 0.002024 | 1 | |
| 0.002419 | 1 | |
| 0.002562 | 1 | |
| 0.003135 | 1 | |
| 0.003334 | 1 | |
| 0.003703 | 1 | |
| 0.004026 | 1 | |
| 0.004032 | 1 | |
| 0.004148 | 1 |
| Value | Count | Frequency (%) |
| 300.340824 | 1 | |
| 11.507579 | 1 | |
| 9.725923 | 1 | |
| 9.684324 | 1 | |
| 8.108664 | 1 | |
| 8.100949 | 1 | |
| 7.989606 | 1 | |
| 7.322976 | 1 | |
| 6.806946 | 1 | |
| 6.528346 | 1 |
| Distinct | 36 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 123.2 KiB |
| CAI CAFÉ MADRID | |
|---|---|
| CAI LA ESPERANZA | 7269 |
| CAI SAN FRANCISCO | 6702 |
| CAI LA CONCORDIA | 6111 |
| CAI VIADUCTO | 6002 |
| Other values (31) |
Length
| Max length | 34 |
|---|---|
| Median length | 15 |
| Mean length | 15.83666365 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1974246 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CAI CAFÉ MADRID |
|---|---|
| 2nd row | CAI CENTENARIO |
| 3rd row | CAI CENTENARIO |
| 4th row | CAI CAFÉ MADRID |
| 5th row | CAI CAFÉ MADRID |
Common Values
| Value | Count | Frequency (%) |
| CAI CAFÉ MADRID | 11816 | 9.5% |
| CAI LA ESPERANZA | 7269 | 5.8% |
| CAI SAN FRANCISCO | 6702 | 5.4% |
| CAI LA CONCORDIA | 6111 | 4.9% |
| CAI VIADUCTO | 6002 | 4.8% |
| CAI KENNEDY | 5634 | 4.5% |
| CAI INEM | 5066 | 4.1% |
| CAI SAN ALONSO | 4590 | 3.7% |
| CAI CENTENARIO | 4321 | 3.5% |
| NO REPORTA | 4121 | 3.3% |
| Other values (26) | 63031 |
Length
| Value | Count | Frequency (%) |
| cai | 100619 | |
| la | 20312 | 5.8% |
| de | 18007 | 5.1% |
| san | 13503 | 3.9% |
| madrid | 11816 | 3.4% |
| café | 11816 | 3.4% |
| estación | 9976 | 2.8% |
| policía | 9976 | 2.8% |
| policia | 9947 | 2.8% |
| santander | 8358 | 2.4% |
| Other values (42) | 135830 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 303930 | |
| 225497 | ||
| I | 218378 | |
| C | 210764 | |
| O | 145219 | |
| N | 124845 | 6.3% |
| E | 115016 | 5.8% |
| R | 114179 | 5.8% |
| S | 89559 | 4.5% |
| D | 77397 | 3.9% |
| Other values (20) | 349462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1741384 | |
| Space Separator | 225497 | 11.4% |
| Decimal Number | 3998 | 0.2% |
| Dash Punctuation | 3367 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 303930 | |
| I | 218378 | |
| C | 210764 | |
| O | 145219 | |
| N | 124845 | |
| E | 115016 | 6.6% |
| R | 114179 | 6.6% |
| S | 89559 | 5.1% |
| D | 77397 | 4.4% |
| T | 71218 | 4.1% |
| Other values (16) | 270879 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1999 | |
| 2 | 1999 |
Space Separator
| Value | Count | Frequency (%) |
| 225497 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3367 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1741384 | |
| Common | 232862 | 11.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 303930 | |
| I | 218378 | |
| C | 210764 | |
| O | 145219 | |
| N | 124845 | |
| E | 115016 | 6.6% |
| R | 114179 | 6.6% |
| S | 89559 | 5.1% |
| D | 77397 | 4.4% |
| T | 71218 | 4.1% |
| Other values (16) | 270879 |
Common
| Value | Count | Frequency (%) |
| 225497 | ||
| - | 3367 | 1.4% |
| 4 | 1999 | 0.9% |
| 2 | 1999 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1940267 | |
| Latin 1 Sup | 33979 | 1.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 303930 | |
| 225497 | ||
| I | 218378 | |
| C | 210764 | |
| O | 145219 | |
| N | 124845 | 6.4% |
| E | 115016 | 5.9% |
| R | 114179 | 5.9% |
| S | 89559 | 4.6% |
| D | 77397 | 4.0% |
| Other values (16) | 315483 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| É | 11816 | |
| Ó | 9976 | |
| Í | 9976 | |
| Ñ | 2211 | 6.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CRIMEN_ID | FECHA | AÑO | MES | MES_num | DIA | DIA_SEMANA | DIA_SEMANA_num | LATITUD | LONGITUD | ZONA | COMUNA | COMUNA_num | BARRIO | UNIDAD_ESPACIAL | TIPO_DELITO_ARTICULO | TIPO_DELITO | TIPO_CONDUCTA | TIPO_LESION | GENERO_VICTIMA | EDAD_VICTIMA | GRUPO_ETARIO_VICTIMA | GRUPO_ETARIO_VICTIMA_num | ESTADO_CIVIL_VICTIMA | MEDIO_TRANSPORTE_VICTIMA | MEDIO_TRANSPORTE_VICTIMARIO | TIPO_ARMA | DISTANCIA_ESTACION_POLICIA_CERCANA | ESTACION_POLICIA_CERCANA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.170557 | -73.135108 | URBANA | MORRORICO | 14 | BUENOS AIRES | CORREGIMIENTO 1 | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 30 | ADULTEZ | 5 | UNION LIBRE | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 1.211200 | CAI CAFÉ MADRID |
| 1 | 2 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.120645 | -73.126050 | URBANA | GARCÍA ROVIRA | 5 | CAMPO HERMOSO | CENTRO | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 21 | JOVENES | 4 | SOLTERO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 0.195689 | CAI CENTENARIO |
| 2 | 3 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.120645 | -73.126050 | URBANA | GARCÍA ROVIRA | 5 | CAMPO HERMOSO | CENTRO | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 23 | JOVENES | 4 | SOLTERO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 0.195689 | CAI CENTENARIO |
| 3 | 4 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.151359 | -73.145705 | URBANA | SAN FRANCISCO | 3 | COMUNEROS | VILLAS DE SAN IGNACIO | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 36 | ADULTEZ | 5 | CASADO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 1.230792 | CAI CAFÉ MADRID |
| 4 | 5 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.170557 | -73.135108 | URBANA | OCCIDENTAL | 4 | GIRARDOT | CORREGIMIENTO 1 | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 20 | JOVENES | 4 | UNION LIBRE | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 1.211200 | CAI CAFÉ MADRID |
| 5 | 6 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.170557 | -73.135108 | URBANA | OCCIDENTAL | 4 | GIRARDOT | CORREGIMIENTO 1 | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 20 | JOVENES | 4 | UNION LIBRE | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 1.211200 | CAI CAFÉ MADRID |
| 6 | 7 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.187455 | -73.131727 | URBANA | NOR ORIENTAL | 2 | LOS ANGELES | VILLA LUZ | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 28 | JOVENES | 4 | CASADO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 3.049477 | CAI CAFÉ MADRID |
| 7 | 8 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.156554 | -73.140753 | URBANA | OCCIDENTAL | 4 | NARIÑO | VILLAS DE SAN IGNACIO | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 42 | ADULTEZ | 5 | SOLTERO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 0.572691 | CAI CAFÉ MADRID |
| 8 | 9 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.120001 | -73.116084 | URBANA | PROVENZA | 10 | PROVENZA | BOLIVAR | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 22 | JOVENES | 4 | SOLTERO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 0.466600 | POLICIA SANTANDER SIC-SIP |
| 9 | 10 | 2010-01-01 | 2010 | ENERO | 1 | 1 | VIERNES | 5 | 7.161314 | -73.139957 | URBANA | CABECERA DEL LLANO | 12 | SOTOMAYOR | CAFE MADRID | ARTÍCULO 111 | LESIONES PERSONALES | LESIONES PERSONALES | LESIONES NO FATALES | MASCULINO | 20 | JOVENES | 4 | SOLTERO | A PIE | A PIE | ARMA BLANCA/CORTOPUNZANTE | 0.177544 | CAI CAFÉ MADRID |
Last rows
| CRIMEN_ID | FECHA | AÑO | MES | MES_num | DIA | DIA_SEMANA | DIA_SEMANA_num | LATITUD | LONGITUD | ZONA | COMUNA | COMUNA_num | BARRIO | UNIDAD_ESPACIAL | TIPO_DELITO_ARTICULO | TIPO_DELITO | TIPO_CONDUCTA | TIPO_LESION | GENERO_VICTIMA | EDAD_VICTIMA | GRUPO_ETARIO_VICTIMA | GRUPO_ETARIO_VICTIMA_num | ESTADO_CIVIL_VICTIMA | MEDIO_TRANSPORTE_VICTIMA | MEDIO_TRANSPORTE_VICTIMARIO | TIPO_ARMA | DISTANCIA_ESTACION_POLICIA_CERCANA | ESTACION_POLICIA_CERCANA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 124653 | 124654 | 2021-02-26 | 2021 | FEBRERO | 2 | 26 | VIERNES | 5 | 7.133930 | -73.126930 | URBANA | SAN FRANCISCO | 3 | SAN FRANCISCO | MUTUALIDAD | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 41 | ADULTEZ | 5 | CASADO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.413128 | CAI SAN FRANCISCO |
| 124654 | 124655 | 2021-02-27 | 2021 | FEBRERO | 2 | 27 | SÁBADO | 6 | 7.091319 | -73.117514 | URBANA | PROVENZA | 10 | SAN LUIS | SAN LUIS | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 20 | JOVENES | 4 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.661552 | CAI SUR |
| 124655 | 124656 | 2021-02-10 | 2021 | FEBRERO | 2 | 10 | MIÉRCOLES | 3 | 7.153306 | -73.136682 | URBANA | NORTE | 1 | TEJAR NORTE (SECTOR II ) | KENNEDY | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 21 | JOVENES | 4 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.354619 | CAI KENNEDY |
| 124656 | 124657 | 2021-02-09 | 2021 | FEBRERO | 2 | 9 | MARTES | 2 | 7.098716 | -73.132870 | URBANA | MUTIS | 17 | URB. BRISAS DEL MUTIS | MUTIS | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 27 | JOVENES | 4 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.206512 | CAI MUTIS |
| 124657 | 124658 | 2021-02-11 | 2021 | FEBRERO | 2 | 11 | JUEVES | 4 | 7.138767 | -73.072906 | RURAL | CORREGIMIENTO 3 | 0 | VDA. RETIRO CHIQUITO | CORREGIMIENTO 3 | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | FEMENINO | 38 | ADULTEZ | 5 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 3.511822 | CAI MORRORICO |
| 124658 | 124659 | 2021-02-20 | 2021 | FEBRERO | 2 | 20 | SÁBADO | 6 | 7.092583 | -73.106933 | URBANA | SUR | 11 | VILLA INES | DIAMANTE I | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 33 | ADULTEZ | 5 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.592530 | CAI VIADUCTO |
| 124659 | 124660 | 2021-02-07 | 2021 | FEBRERO | 2 | 7 | DOMINGO | 7 | 7.148965 | -73.115512 | RURAL | CORREGIMIENTO 1 | 0 | VRDA ABEJAS | CORREGIMIENTO 3 | ARTÍCULO 239 | HURTO A MOTOCICLETAS | NO REPORTA | LESIONES NO FATALES | MASCULINO | 35 | ADULTEZ | 5 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 1.247678 | CAI LA ESPERANZA |
| 124660 | 124661 | 2021-02-04 | 2021 | FEBRERO | 2 | 4 | JUEVES | 4 | 7.094033 | -73.132868 | URBANA | MUTIS | 17 | MUTIS | MONTERREDONDO | ARTÍCULO 239 | HURTO A AUTOMOTORES | NO REPORTA | LESIONES NO FATALES | FEMENINO | 29 | ADULTEZ | 5 | UNION LIBRE | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.693896 | CAI MUTIS |
| 124661 | 124662 | 2021-02-03 | 2021 | FEBRERO | 2 | 3 | MIÉRCOLES | 3 | 7.078868 | -73.117173 | URBANA | SUR | 11 | URB. BRISAS DE PROVENZA | BRISAS DE PROVENZA | ARTÍCULO 239 | HURTO A AUTOMOTORES | NO REPORTA | LESIONES NO FATALES | MASCULINO | 34 | ADULTEZ | 5 | UNION LIBRE | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.718624 | CAI INEM |
| 124662 | 124663 | 2021-02-05 | 2021 | FEBRERO | 2 | 5 | VIERNES | 5 | 7.097263 | -73.125917 | URBANA | LA CIUDADELA | 7 | URB. CIUDAD BOLIVAR | REAL DE MINAS | ARTÍCULO 239 | HURTO A AUTOMOTORES | NO REPORTA | LESIONES NO FATALES | MASCULINO | 29 | ADULTEZ | 5 | SOLTERO | A PIE | A PIE | SIN EMPLEO DE ARMAS | 0.198304 | CAI REAL DE MINAS |